Using KCCA for Japanese-English cross-language information retrieval and classification
نویسندگان
چکیده
Kernel Canonical Correlation Analysis (KCCA) is a method of correlating linear relationship between two multidimensional variables in feature space. We applied the KCCA to the Japanese-English cross-language information retrieval and classification. The results were encouraging.
منابع مشابه
Advanced learning algorithms for cross-language patent retrieval and classification
We study several machine learning algorithms for cross-language patent retrieval and classification. In comparison with most of other studies involving machine learning for cross-language information retrieval, which basically used learning techniques for monolingual sub-tasks, our learning algorithms exploit the bilingual training documents and learn a semantic representation from them. We stu...
متن کاملQuery Translation from Indonesian to Japanese Using English as Pivot Language
In this paper, we propose a query translation method for Cross Lingual Information Retrieval (CLIR) system which works for Japanese (target language) documents with Indonesian (source language) queries. Because Indonesian-Japanese is an unfamiliar language pair, it is difficult to translate Indonesian queries to Japanese directly. Therefore, we use English as a pivot language for transitive tra...
متن کاملComparing Multiple Methods for Japanese and Japanese-English Text Retrieval
The NACSIS collection of Japanese scienti c documents (with English titles) provides a solid foundation for information retrieval research into 1) segmentation methods for Japanese text, 2) e ective methods for monolingual Japanese retrieval, and 3) JapaneseEnglish cross-language retrieval. This paper compares multiple methods for Japanese and Japanese-English text retrieval. Our focus is on ac...
متن کاملReliable Measures for Aligning Japanese-English News Articles and Sentences
We have aligned Japanese and English news articles and sentences to make a large parallel corpus. We first used a method based on cross-language information retrieval (CLIR) to align the Japanese and English articles and then used a method based on dynamic programming (DP) matching to align the Japanese and English sentences in these articles. However, the results included many incorrect alignm...
متن کاملJapanese/English Cross-Language Information Retrieval: Exploration of Query Translation and Transliteration
Cross-language information retrieval (CLIR), where queries and documents are in different languages, has of late become one of the major topics within the information retrieval community. This paper proposes a Japanese/English CLIR system, where we combine a query translation and retrieval modules. We currently target the retrieval of technical documents, and therefore the performance of our sy...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999